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(57) ABSTRACT 

A hierarchical document comparison system comprises a 
document server computer remotely accessible by a user 
computer. The document server computer includes a data- 
base which stores the contents of a first document and a 
second document. The first document contains one or more 
segments and the second document contains one or more 
segments. Each segment is identified by a segment identifier. 
The document server computer also includes a module 
executable in the document server computer. The module is 
configured to receive a request to compare the first docu- 
ment and the second document. The module compares the 
first document and the second document and identifies the 
segments in the documents containing differences. The 
module then displays on the user computer a side-by-side 
display listing the segment identifiers for the identified 
segments. The segment identifiers for the identified seg- 
ments in the first document containing differences are dis- 
played in a first list and the segment identifiers for the 
identified segments in the second document containing 
differences are displayed in a second list. 

22 Claims, 9 Drawing Sheets 
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HIERARCHICAL DOCUMENT 
COMPARISON SYSTEM AND METHOD 



BACKGROUND 

1. Field 

The present invention relates generally to the field of 
document comparison. More particularly, the invention 
relates to a system and method for the storage, retrieval, and 
comparison of documents. 

2. Description of the Related Art 

The advent of text processing application programs have 
enabled the computer to become a viable tool for document 
creation. A user is able to develop a document by entering 
the text comprising the document into the computer using 
such an application program. Typically, the document con- 
tents are stored on the computer in what is known as a file. 
The user is able to subsequently make modifications to the 
document by recalling the stored file and making the desired 
changes. The user can then save the subsequent version of 
the document containing the modifications as the same file 
or a different file. Saving the document containing the 
modifications as a different file enables the user to create 
multiple versions of the document. 

In the course of creating a document, it may be necessary 
or desirable for the user to compare two versions of the 
document. One method of comparing the different versions 
was to create a printout of the files containing the appropri- 
ate versions of the document and visually comparing the two 
printed files word-by-word and noting the changes. This 
process was labor intensive and very time consuming, 
especially for large documents. The user still had to labo- 
riously scroll through the two files and note the changes. 

Commercially available word processing programs such 
as Word 97, from Microsoft Corporation, and WordPerfect 
version 8.0, from WordPerfect Corporation, include a docu- 
ment compare feature that enables the comparison of a 
document currently displayed on the screen with a document 
stored on the disk as a file. The text which exist in the 
document stored on the disk but not in the currently dis- 
played document is copied and inserted into the appropriate 
position in the currently displayed document and indicated 
by strikeout codes such as a "redline" through the text. The 
text which exists in the currently displayed document but not 
in the document stored on the disk is indicated accordingly, 
for example, by underlining the text. The user of these word 
processing programs still has to inspect the content of the 
entire document for the changes. 5Q 

SUMMARY 

The present invention hierarchically compares one or 
more documents stored on a computer. The hierarchical 
comparison allows a user to efficiently identify and view 55 
only those segments within the documents that are different 

In one preferred embodiment, a document server com- 
puter stores one or more documents in a document database. 
Additionally, the documents may advantageously be 
grouped into one or more document types or categories of 60 
documents. A category of document may advantageously be 
a logical grouping of types of documents including one or 
more versions of individual documents. The document 
server computer includes one or more web pages which axe 
accessible by one or more users over a communication 65 
medium. The web pages enable a user to remotely request a 
comparison between two documents. The documents stored 
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40 
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in the document database include one or more segments. A 
document segment is an identifiable portion of a document. 
For example, a segment may be a chapter, section, 
subsection, page, and the like. In one embodiment, the 
document database stores the document separated into its 
document segments. 

A user utilizes a web browser executing on his or her user 
computer to connect to the document server computer. Once 
connected, the user can access the web pages and request a 
comparison between two documents. In one embodiment, 
the document server computer requests that the user specify 
a category of document to compare. The document server 
computer may also request a user password to ensure that the 
user has authorization to access the requested category of 
document. Having verified the password, the document 
server computer lists the documents contained in the 
requested category of documents. 

In one embodiment, the most recent prior version of each 
of the documents may advantageously be listed in one list 
and the current version of the documents may advanta- 
geously be presented in another list. The user can then select 
a first document from one list and another document from 
the other list and request the document server computer to 
compare the selected documents. Substantially similar or 
like segments in the selected documents are compared, and 
segments containing differences or changes are identified. 
Upon detecting the first difference or change, the compari- 
son is stopped for that segment, and the comparison pro- 
ceeds to another segment. If a particular segment in one 
document does not have a substantially similar or like 
segment in the other document, the particular segment is 
identified as containing differences or changes. The seg- 
ments identified as containing modifications or changes are 
listed in a side-by-side display on the user computer. One 
side of the display advantageously lists the identified seg- 
ments from the first document and the other side of the 
display advantageously lists the identified segments from the 
second document. Ordinarily, the selected documents would 
be the current version and the immediately prior version of 
the same document. However, the user may select any 
document from the first list to be compared with any selected 
document from the second list. 

The user can then select a first segment from one list and 
a second segment from the other list and request the docu- 
ment server computer to compare the selected segments. The 
document server computer compares the selected segments 
identifying the differences or changes between the two 
segments. Differences or changes between the selected seg- 
ments may advantageously be identified at two levels. 
Components in the segment containing the differences or 
changes are appropriately indicated as well as the actual 
elements or subcomponents which are different. For 
example, a word contained in a sentence may be different 
between the selected segments. The document server com- 
puter can then identify the word as a unit or subcomponent 
containing differences or changes, and accordingly identify 
the sentence containing the changed word as the component 
containing differences or changes. The document server 
computer then displays the entire contents of each of the 
selected segments in a side -by-side display on the user 
computer. The identified components and subcomponents 
are distinguished in the side-by-side display of selected 
segment contents so that the user can quickly identify the 
changes that have been made to the segment contents. 

The document server computer advantageously performs 
a document comparison in manageable hierarchies or stages. 
In one embodiment, a request to compare a category of 
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document presents a list of documents in the requested FIG. 9 is a representation of one embodiment of a 

category of documents. A subsequent document comparison document database of the invention, 
presents a list of segments in the document that contain 

differences or changes. A further segment comparison pre* DETAILED DESCRIPTION 

sents the contents of the segment with the actual differences 5 A document server facilitates a hierarchical, side-by-side 

or changes, and the neighborhood of the actual differences comparison of documents over a communication medium. A 

or changes, appropriately indicated for clarity and identifi- firsl document is compared to a second document in 

cation, hierarchies, and the result of the hierarchical comparison is 

Then, for example, a user can easily and efficiently presented in a side-by-side display. As an example, the 

observe changes made to a set of proposed government to complete document may advantageously be considered the 

regulations including thousands of pages of regulations and top hierarchy. A segment of the document, such as a section 

comments, or an agreement containing a large volume of or subsection within the document, can then be considered 

exhibits with specifications without having to view and the next hierarchy. A component of the document segment, 

scroll through the entire document. Hie proposed regulation such as a paragraph or a sentence, can be considered the next 

or the agreement may contain one or more volumes and each is lower hierarchy. Finally, subcomponents of the components, 

volume may additionally contain thousands of sections and such as a word or a character, may advantageously be 

thousands of pages. Changes to the proposed regulations or considered the lowest hierarchy. Differences at each hierar- 

agreements are made to a copy, or another version, stored on chy are identified and presented in a side-by-side manner, 

the document server computer. The regulations and agree- i n one embodiment of the invention, the document server 

ments may be subject to review by one or more authorized 20 deludes a document database which stores the documents in 

users, and the users can efficiently and conveniently identify electronic form. Typically, multiple versions of the docu- 

and observe the changes made to the regulations and agree- ments are stored in the document database, and a user 

ments. The regulation or the agreement may be compared in compares different versions of a document. The document 

hierarchies. At each hierarchical level, each segment of the server further comprises a web server and an interface 

document (which has been split up in accordance with 25 mo fc\ t which enables access to the electronic contents 

natural divisions in the document, such as sections) that has stored in the document database. The web server provides 

been changed is identified and presented to the user. From connectivity to a communication medium such as the Inter- 

this list of segments which contain content changes, the user ne t and the World Wide Web (WWW). The document server 

can easily and conveniently focus in on a particular portion includes a plurality of HTML page definitions (hereinafter 

of the regulation or agreement that has been changed. The 30 re f erre d to as HTML pages or web pages) accessible over the 

specified portion of the regulation or agreement of interest is communication medium through the web browser, 

further compared. Ultimately, the contents of the portion of A ^ mi]izes a browser executing on a user computer to 

the regulation or agreement of interest are fully displayed to access the HTML pages on the document server. Using the 

the user. The actual changes, as well as the neighborhood of HTML pages, the user submits identifying information to 

the changes, are further distinguished in this display for ease 35 ^ access tQ ^ documents stQred in the doC ument data- 

of identification. ^he user may then select a particular document or 

BRIEF DESCRIPTION OF THE DRAWINGS documents and request the document server to perform 

m , , , , „ certain operations such as, by way of example, comparing 

Tnese and other aspects, advantages, and novel features of me selected documentSj displaying the selected document, 

the invention will become apparent upon reading the fol- 40 ^ ^ ^ documeQt Furthermore) 

lowing detailed description and upon reference to the because the document server is remotely accessible over the 

accompanying drawings in which: communication medium, multiple users located at differing 

FIG. 1 is a system block diagram illustrating an embodi- geographical locations can access the document server and 

menl of the overall network architecture; concurrently perform operations on the same document or 

FIG. 2 is a high level block diagram illustrating one 4 documents, 

embodiment of the selected components of a user computer Even lnoug h lne invention is suitable for hierarchically 

architecture and a document server computer architecture; comparing any number of documents or other electronic 

FIG. 3 is a block diagram illustrating one embodiment of content stored on the document server at one time, the 

a flow of information between a user computer and a 5g invention will be further disclosed in the context of bierar- 

document server computer through a communication chically comparing two versions of a document and pre- 

medium; seating the results in a side-by-side display. Throughout the 

FIG, 4 is a flowchart illustrating a process by which a user drawings, components which correspond to components 

requests the comparison of two documents in accordance shown in previous figures are indicated using the same 

with one embodiment of the present invention; 55 reference numbers. 

FIG. 5 is a flow chart illustrating a process by which two One network architecture suitable for use with one pre- 

documents are compared in accordance with one embodi- ferred embodiment of the invention is indicated generally by 

ment of the present invention; a system 10 in FIG. 1. The system 10 includes a plurality of 

FIG. 6 is a representation of one embodiment of an HTML user computers 102 and a document server computer 104 

page illustrating a side-by-side display of a list of docu- 60 which communicate with each other by use of a communi- 

ments; cation medium 106. Alternatively, the system 10 may 

FIG. 7 is a representation of one embodiment of an HTML include one or more document server computers 104. 

page illustrating a side-by-side display of a list of segments A computer, including the computers 102 and 104, may be 

of two documents containing differences; any microprocessor or processor (hereinafter referred to as 

FIG. 8 is a representation of one embodiment of an HTML 65 processor) controlled device that permits access to the 

page illustrating a side-by-side display of the contents of two communication medium 106, including terminal devices, 

segments-of two documents; and such as personal computers, workstations, servers, mini 
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computers, main-frame computers, laptop computers, a net- In one embodiment, the Internet includes a Domain Name 
work of individual computers, mobile computers, palm top Service (DNS). As is well known in the art, the Internet is 
computers, hand held computers, set top box for a TV, an based on Internet Protocol (IP) addresses. The DNS trans- 
interactive television, an interactive kiosk, a personal digital lates alphabetic domain names into IP addresses, and vice 
assistant, an interactive wireless communications device, or 5 versa. The DNS is comprised of multiple DNS servers 
a combination thereof. The computers may further possess situated on multiple networks. In translating a particular 
input devices such as a keyboard or a mouse, and output domain name into an IP address, multiple DNS servers may 
devices such as a computer screen or a speaker. be accessed until the domain name translation is accom- 
Furtbermore, the computers may serve as clients, servers, or plished. 

a combination thereof. 10 One part of the Internet is the World Wide Web (WWW). 

These computers may be uniprocessor or multiprocessor The WWW is generally used to refer to both (1) a distributed 

machines. Additionally, these computers include an addres- collection of interlinked, user-viewable hypertext docu- 

sable storage medium or computer accessible medium, such ments (commonly referred to as "web documents" or "web 

as random access memory (RAM), an electronically eras- pages" or "electronic pages" or "home pages") that are 

able programmable read-only memory (EEPROM), pro- 15 accessible via the Internet, and (2) the client and server 

grammable read-only memory (PROM), erasable program- software components which provide user access to such 

mable read-only memory (EPROM), hard disks, floppy documents using standardized Internet protocols. The web 

disks, laser disk players, digital video devices, compact documents are encoded using Hypertext Markup Language 

disks, video tapes, audio tapes, magnetic recording tracks, (HTML) and the primary standard protocol for allowing 

electronic networks, and other techniques to transmit or 2 n applications to locate and acq uire w eb documents is the 

store electronic content such as, by way of example, pro- Hypertext Transfer Protocol (HTTP). However, the term 

grams and data. In one preferred embodiment, the computers WWW is intended to encompass future markup languages 

are equipped with a network communication device such a and transport protocols w hich m ay be used in place of, or in 

network interface card, a modem, or other network connec- addition to, HTML and HTTP. 

tion device suitable for connecting to the communication 2 s The WWW contains different computers which store 

medium 106. Furthermore, the computers execute an appro- electronic pages, such as HTML documents, capable of 

priate operating system such as Unix, Microsoft® Win- displaying graphical and textual information. The document 

dows® 3.1, Microsoft® Windows® 95, Microsoft® Win- server computer 104 which provides information, for 

dows® NT, Apple® MacOS®, or IBM® OS/2®. As is example, the electronic content to be hierarchically 

conventional, the appropriate operating system includes a 30 compared, on the WWW is generally referred to as a 

communications protocol implementation which handles all "website." A website is defined by an Internet address, and 

incoming and outgoing message traffic passed over the the Internet address has an associated electronic page, 

communication medium 106. In other embodiments, while Generally, an electronic page may advantageously be a 

the operating system may differ depending on the type of document which organizes the presentation of text, graphi- 

computer, the operating system will continue to provide the 35 cal images, audio and video. 

appropriate communications protocols necessary to estab- One of ordinary skill in the art will recognize that the 

lish communication links with the communication medium communication medium 106 may advantageously be com- 

106. prised of other types of networks without detracting from the 

The computers may advantageously contain program scope of the invention. The communication medium 106 can 

logic, or other substrate configuration representing data and 40 include, by way of example, local area networks (LANs), 

instructions, which cause the computer to operate in a wide area networks (WANs), public internets, private 

specific and predefined manner as described herein. In one intranets, a private computer network, a secure internet, a 

embodiment, the program logic may advantageously be private network, a public network, a value-added network, 

implemented as one or more modules. The modules may interactive television networks, wireless data transmission 

advantageously be configured to reside on the addressable 45 networks, two-way cable networks, interactive kiosk 

storage medium and configured to execute on one or more networks, and the like. 

processors. The modules include, but are not limited to, In addition to the Internet, the communication medium 
software or hardware components which perform certain 110 may advantageously include network service providers 
tasks. Thus, a module may include, by way of example, that offer electronic services such as, by way of example, 
components, such as, software components, object-oriented 50 Internet Service Providers (hereinafter referred to as ISP), 
software components, class components and task An ISP or other network service provider may advanta- 
components, processes methods, functions, attributes, geously support both dial-up and direct connection in pro- 
procedures, subroutines, segments of program code, drivers, viding access to various types of networks. An ISP can be a 
firmware, microcode, circuitry, data, databases, data computer system which provides access to the Internet, 
structures, tables, arrays, and variables. 55 Generally, the ISP is operated by an ISP company. Examples 
The communication medium 106 may advantageously of ISP companies include America On-line, the Microsoft 
facilitate the transfer of electronic content. In one Network, Network Intensive, and the like. Typically for a 
embodiment, the communication medium 106 includes the fee, these ISP companies provide a user a software package, 
Internet. The Internet is a global network connecting mil- usemame, password, and access phone number. Using this 
lions of computers. The structure of the Internet, which is 60 information, the user can then use the user computer 102 to 
well known to those of ordinary skill in the art, is a global connect to the ISP and access the Internet. Those of ordinary 
network of computer networks utilizing a simple, standard skill in the art will realize that the ISP is optional and a 
common addressing system and communications protocol computer can advantageously execute software programs 
called Transmission Control Protocol/Internet Protocol providing direct access to the Internet. In this instance, the 
(TCP/IP). The connection between different networks are 65 computer may be connected directly to the Internet, 
called "gateways", and the gateways serve to transfer elec- In one embodiment, a user utilizes the user computer 102 
tronic data worldwide. and communicates with the document server computer 104 
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by accessing the communication medium 106. For example, 
the user invokes a browser which executes on the user 
computer 102. The browser is further discussed below. The 
browser, in turn, establishes a communication link to the 
Internet directly or through an ISP. Once connected to the 
communication medium 106, the user can direct the browser 
to access the information made available on the document 
server computer 104. 

The document server computer 104 includes a plurality of 
web pages which facilitate the comparison of electronic 
contents stored on the document server computer 104. As an 
example, one or more versions of a document are stored 
electronically on the document server computer 104. The 
document comparison is performed between one or more 
versions of substantially similar or like documents. The 
electronic contents may be information such as, by way of 
example, text, graphics, video, audio, and other forms of 
information capable of being stored on a computer. In one 
embodiment, the documents may advantageously include 
text or graphical information. In another embodiment, the 
documents may advantageously be comprised of audio 
information or video information. In still another 
embodiment, the documents can include any combination of 
the aforementioned forms of electronic content. 

While communicating with the document server com- 
puter 104, the user is presented with a web page (a window 
displayed on the user computer 102) from which the user can 
remotely request a comparison of a particular type of 
document or category of document stored on the document 
server computer 104, A type of document or category of 30 
document may advantageously be a logical subject matter 
grouping of documents including one or more versions of 
each document (each version of which will advantageously 
be stored as a document in the system). The document server 
computer 104 can store one or more categories of documents 
or types of documents. By way of example, a first version 
and a second version of the FAA documents as well as a first 
version and a second version of the Department of Interior 
(DOI) documents may be stored on the document server 
computer 104. The FAA documents may advantageously be 
one category of document and the DOI documents may 
advantageously be another category of document. Typically, 
a category of document, such as the FAA documents, can 
consist of hundreds of documents. These documents may 
include, for example, the following documents: "Standards 
and Recommended Practices" and "Digital Voice Recorder." 
Furthermore, each document may be very large, consisting 
of thousands of pages. The thousands of pages can addi- 
tionally be grouped into hundreds or even thousands of 
segments which the document may naturally be divided into. 
For example, segments may advantageously be sections and 
subsections actually used and labeled in each document 
(hereinafter, segment, section, and subsection are used 
interchangeably). 

A user such as an FAA agency official responsible for 
maintaining and tracking the changes from one version of 
the FAA documents to a subsequent version of the FAA 
documents may advantageously benefit by quickly and 
easily observing changes made between the differing ver- 
sions of the documents. In another embodiment, a lobbyist 
could quickly implement or modify strategies by easily and 
efficiently tracking the changes made to the FAA documents 
without having to pour through the thousands of pages. 

Having selected a category of document for comparison, 
for example, the FAA documents, the user, such as an FAA 
agency official, may be required to provide further identi- 
fying information, such as a password, to the document 
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server computer 104. The document server computer 104 
authenticates the user provided identifying information to 
ensure that the documents in the requested category of 
document is accessed only by authorized users. Having 
authenticated the user information, the document server 
computer 104 lists the documents contained in the requested 
category of document. Continuing the above FAA example, 
the documents contained in the FAA category of document, 
including the "Standards and Recommended Practices" 
document and the "Digital \foice Recorder" document, are 
presented to the FAA agency official in a selectable list. In 
one embodiment, the documents are presented in a side -by- 
side display. One display lists one version of the FAA 
documents and the other display lists another version of the 
FAA documents. 

The user can then select and submit a document contained 
in the requested category of document for comparison by the 
document server computer 104. Continuing the FAA 
example, the FAA agency official can select the "Standards 
and Recommended Practices'* document from the selectable 
list for comparison by the document server computer 104. In 
one embodiment, the "Standards and Recommended Prac- 
tices" document may be a telecommunications specification 
for international aviation defining the standards for the 
Aeronautical Telecommunication Network ("ATN"). 

Having received a user selected document for 
comparison, the document server computer 104 compares 
differing versions of the segments contained in the requested 
document. By way of example, a document may contain the 
following sections: 

Section A 
Section A.l 

[text] 
Section A.2 
[text] 

Section B 
[text] 

Section C 
[text] 

For the above document, the identifiable portions or seg- 
ments of the document may advantageously be the sections 
containing text. For example, the segments may advanta- 
geously be "Section A.1," "Section A.2," "Section B " and 
"Section C." Then in comparing the document, the segment 
"Section A.l" in a first version of the document is compared 
to a "Section A. 1" in a second version of the document. Both 
the first and the second versions of the document are stored 
on the document server computer 104. When a first modi- 
fication or change is detected in comparing a document 
segment, the segment is identified as containing a change, 
the comparison of that particular segment ceases, and the 
comparison of another segment of the document com- 
mences. Subsequently, the document server computer 104 
presents a list of segments which contain differences or 
changes in a side-by-side display. For example, if "Section 
A.l" is identified as being changed between the compared 
versions, the side-by-side display advantageously lists "Sec- 
tion A.l" as being changed. In another embodiment, a 
segment may advantageously be delineated at the section 
level, for example, by the sections "Section A," "Section B," 
and "Section C" above, contained in a document. In still 
another embodiment, the segment may be any identifiable 
portion of the document such as a chapter or a page. 

Continuing the FAA example, the FAA agency official 
may advantageously select and submit the "Standards and 
Recommended Practices" document for comparison by the 
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document server computer 104. Subsequently, the document 
server computer 104 compares the current version with the 
most recent older version on a segment by segment basis. 
For example, the "Standards and Recommended Practices" 
document may include thousands of segments including two 5 
segments entitled "Routing of Messages" and "Security on 
the Internet." One version of each segment of the "Standards 
and Recommended Practices" document is compared with 
the next earlier version of the substantially similar or like 
segment. In the FAA example, one version of the "Routing io 
of Messages" segment is compared to another version of the 
"Routing of Messages" segment. If there is a modification or 
change in the segment, the segment is identified as contain- 
ing a difference between versions. Once all the segments in 
the "Standards and Recommended Practices" document is 
have been compared, the identified segments are listed in a 
side-by-side display. For example, both the "Routing of 
Messages" segment and "Security on the Internet" segment 
may be listed in the side-by-side display as containing 
differences between version. One display lists one version of 20 
the identified segments and the other display lists the other 
version of the identified segments. If no change has been 
made within a segment, that segment is not listed. 

Subsequently, the user may advantageously select and 
submit one segment from the first list and another segment 25 
from the second list in the side-by-side display for compari- 
son by the document server computer 104. The document 
server computer 104 then advantageously compares the 
contents of the submitted segments and identify 
components, such as, by way of example, a paragraph or a 30 
sentence, within the segments that are different. The con- 
tents of the submitted segments are presented to the user in 
a side-by-side display. One display shows the contents of 
one segment and the other display shows the contents of the 
other segment. The identified components within the seg- 35 
meat which contain differences are presented in a manner 
which distinguishes it from the rest of the contents of the 
displayed segment. For example, the components which 
remain the same between the submitted segments can be 
presented in one color, such as blue, while the components 40 
which are different between the submitted segments can be 
presented in a different color, such as red. In another 
embodiment, the contents of one submitted segment may be 
presented in a first color, such as green, while the contents 
of the other submitted segment may be presented in a second 45 
color, such as white, to further distinguish the contents of 
one segment from the contents of the other segment. 

Within the identified components, the actual 
subcomponent, such as , by way of example, the word, letter, 
pixel, byte, or bit, which is different between the submitted 50 
segments may advantageously be further identified and 
distinguished in the display of the segment contents. The 
identified subcomponent can be further distinguished in the 
side-by-side display of segment contents by means such as 
underlining or holding. For example, in one display, the 55 
sentence containing a word which is different can be dis- 
played in red, and the actual word which is different can be 
further underlined. The subcomponent is the actual content 
of the segment which is different while the component is the 
"neighborhood" containing the subcomponent. 60 

Continuing the FAA example, the FAA agency official 
may select and submit the "Routing of Messages" segment 
from both lists in the side-by-side display. Subsequently, the 
document server computer compares the first version of the 
"Routing of Messages" segment to the second version of the 65 
"Routing of Messages" segment. The components of the 
segments containing differences are identified. The subcom- 
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ponents within the identified components which are actually 
different are also identified. The contents of the first version 
of the "Routing of Messages" segment is displayed in one 
side of the side-by-side display. The contents of the second 
version of the "Routing of Messages" segment is displayed 
in the other side of the side-by-side display. The sentences 
which have changed between the two versions may advan- 
tageously appear in a different color or font than the reset of 
the contents of the "Routing of Messages" segment. The 
actual words within the sentences which are different may 
advantageously be further underlined or bolded in the dis- 
play. 

In another embodiment, selecting a category of docu- 
ments to compare results in the documents server computer 
104 displaying documents in the selected category of docu- 
ments which have changed between the prior and current 
versions of the document in a side-by-side display. For 
example, the document server computer 104 can compare 
the time stamps of the document, or the size of the 
document, as stored on the document server computer 104 
to determine if a document has changed between versions. 
In this embodiment, the user is only presented with docu- 
ments containing differences. 

One benefit of the invention is that it permits users to 
compare very large documents in stages or hierarchies. At 
the initial or first hierarchy is the type of documents or 
category of documents. Each type of documents or category 
of documents is an element or node in the particular hier- 
archy. In the FAA example above, the FAA category of 
documents can advantageously be one node in the first 
hierarchy. Another node in the top hierarchy would be, for 
example, the DOI documents. The documents contained in 
the category of documents compose the next or second 
hierarchy. Each document is an element or node in this 
particular hierarchy. In the FAA example, each one of the 
hundreds of documents, including the "Standards and Rec- 
ommended Practices" Document and the "Digital Voice 
Recorder*' document, can advantageously be one node in the 
second hierarchy. The segments contained in a document 
compose the third hierarchy. In the FAA example, each one 
of the thousands of segments, including the "Routing of 
Messages" segment and the "Security on the Internet" 
segment, can advantageously be one node in the third 
hierarchy. The components which are different when com- 
paring one or more submitted segments are elements con- 
tained in a fourth hierarchy. The subcomponents contained 
in the components are elements in a fifth hierarchy. For 
example, each sentence (component) identified as being 
different between two submitted segments can advanta- 
geously be considered one node in the fourth hierarchy. 
Furthermore, each word (subcomponent) identified as being 
different between the two submitted segments can advanta- 
geously be considered one node in the fifth hierarchy. 

Thus, the invention advantageously compares documents, 
including very large documents, in a layered or hierarchical 
manner. In one embodiment, requesting a comparison of a 
particular category of documents (an element in the first 
hierarchy) results in a listing of the documents contained in 
the requested category of documents (the listing of elements 
in the second hierarchy). Requesting a comparison of a listed 
document (an element in the second hierarchy) results in a 
listing of the segments contained in the requested document 
(elements in the third hierarchy) containing differences. 
Requesting a comparison of one or more segments (one or 
more elements in the third hierarchy) results in the display 
of segment contents with the components (elements in the 
fourth hierarchy) and the subcomponents (elements in the 
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fifth hierarchy) being further distinguished in the display for 
ease of identification. Those of ordinary skill in the art will 
realize that there may be additional hierarchies or fewer 
hierarchies without detracting from the hierarchical com- 
parison element of the present invention. 

FIG. 2 illustrates in more detail selected components of 
the user computer 102 and the document server computer 
104 of FIG. 1 suitable to implement one embodiment of the 
present invention. The user computer 102 includes a browser 
202. The document server computer 104 includes a web 
server 204, an interface module 206, and a document 
database 208. The depicted components may advanta- 
geously communicate with each other and other components 
comprising the respective computers through mechanisms 
such as, by way of example, interprocess communication, 
remote procedure call, and other various program interfaces. 
Furthermore, the functionality provided for in the 
components, modules, and databases may be combined into 
fewer components, modules, or databases or further sepa- 
rated into additional components, modules, or databases. 
Additionally, the components, modules, and databases may 
advantageously be implemented on one or more computers. 

The browser 202 is a software program which allows a 
user to access different computers, including the document 
server computer 104, through the communication medium 
106. In one preferred embodiment, the browser 202 may be 
a standard browser such as the Netscape® Navigator devel- 
oped by Netscape, Inc. or the Microsoft® Internet Explorer 
developed by Microsoft Corporation. One of ordinary skill 
in the art will realize that other types of access software 
could also be used to implement the browser 202. The other 
types of access software could be, by way of example, other 
types of Internet browsers, custom network browsers, two- 
way communications software, cable modem software, 
point-to-point software, custom emulation programs, and 
the like. 

A user utilizes the browser 202 to access the document 
server computer 104, and more particularly, the web pages 
which facilitate the hierarchical side-by-side comparison of 
documents, and requests a document comparison be per- 
formed by the document server computer 104. One embodi- 
ment of a process by which a user requests a comparison of 
two documents is illustrated by the flow chart in FIG. 4. 
Beginning in a start state 400, the user initiates the execution 
of the browser 202 on his or her user computer 102. The user 
directs the user computer 102, utilizing the browser 202, to 
establish a communications link or network connection to 
the document server computer 104 through the communi- 
cation medium 106. Having successfully established the 
network connection, the user is presented with a first web 
page stored on the document server computer 104 in state 
402. In particular, the browser 202 displays the first web 
page which provides the user the capability to select a 
category of document to be compared by the document 
server computer 104. In one embodiment, a list of categories 
of documents stored on the document server computer 104 
that can be compared may advantageously be presented to 
the user through a pull-down menu. From the list of the 
categories of documents, the user can use a pointing device, 
such as a mouse or the like, and select the desired category 
of document to compare. In another embodiment, the first 
web page may contain a data entry field conducive to 
accepting input from the user. The user may then use an 
input device, such as a keyboard, microphone, and the like, 
and specify the desired category of document. 

Proceeding to state 404, the user provides a password, or 
other identifying information, to the document server com- 



puter 104. In one embodiment, the password is entered 
through the same web page through which the user specified 
the category of document. In another embodiment, the 
document server computer 104 may present the user a 

5 different web page which is to be used to provide the 
password. In still another embodiment, the user may not be 
required to provide a password to utilize the document 
comparison facility. In yet another embodiment, certain 
selected categories of documents, such as, by way of 

1Q example, categories of documents containing confidential, 
classified, or sensitive documents, may require a password 
from the user. In state 406, the user requests a list of 
documents contained in the specified category of document. 
Again, this may be requested through either the same web 
page through which the user selected the category of 

15 document, the same web page through which the user 
provided the password, or through still another web page 
altogether. Those of ordinary skill in the art will realize that 
the particular ordering of the states 402, 404, and 406 is not 
critical, and that the aforementioned states may be rear- 

20 ranged in a different order, or even possibly omitted, without 
detracting from the scope of the invention. 

Subsequently, the user is presented a web page containing 
a side-by-side list of documents 210 (see FIG. 2). The 
side-by-side list of documents identifies the documents 

25 contained in the specified category of document. In another 
embodiment, the side-by-side list of documents 210 identi- 
fies the documents which contain differences between one or 
old version of the document and another or new version of 
the same or like document. In still another embodiment, the 

30 user may be permitted to specify a threshold level of 
difference in order for a document to be included in the 
side-by-side list of documents 210. As an example, the user 
may be permitted to specify that documents containing an 
arbitrary number of different words, such as ten (10), 

35 between the old version and the new version are to be 
included in the side-by-side list of documents 210. In yet 
another embodiment, the threshold level of difference may 
advantageously be specified in units such as, by way of 
example, nibbles, bytes, pixels, and units of time such as 

40 seconds or minutes for electronic content such as video. One 
list identifies the old version of the documents (old docu- 
ment list). The other list identifies the new version of the 
documents (new document list). 
Proceeding to state 408, the user selects a document from 

45 the old document list. The user may then advantageously 
select a document from the new document list in state 410. 
The user then submits the selected documents to the docu- 
ment server computer 104 for comparison in state 412. In 
one embodiment, the user may select any document from the 

50 old document list and any other document from the new 
document list. The document server computer 104 advan- 
tageously compares the submitted documents and appropri- 
ately identifies the segments contained within the submitted 
documents containing changes or differences. Typically, the 

55 user selects like documents, for example, an old version of 
a particular document from the old document list and the 
new version of the same document from the new document 
list, and submits the specified documents for comparison. 
Moreover, the user may advantageously select one 

60 document, either from the old document list or the new 
document list, and submit the selected document for com- 
parison. In this case, the document server computer 104 
need not compare the submitted document in order to 
identify the segments contained in the submitted document 

65 as being changed. 

Subsequently, the user is presented a web page containing 
a side-by-side list of segments 212 (see FIG. 2). The 
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side-by-side list of segments 212 identifies the segments according incremented in sequence. As an example, the old 
contained within the specified documents which contain version of the document may include segments "Section A," 
differences or changes. One list identifies the segments of "Section B," and "Section C." In the new version of the 
the old version of the document containing changes (old document, an entirely new segment, titled "Section A," may 
section list). The other list identifies the segments of the new s have been inserted before the existing "Section A" from the 
version of the document containing changes (new section old version of the document. The "Section A" from the old 
list). As an example, a "section j" present in the specified version of the document may have been renamed "Section 
document from the old document list and a comparable or B" in the new version of the document. Likewise, the old 
like "section j" present in the specified document from the "Section B" and "Section C may have been respectively 
new document list may contain differences or changes. In 10 renamed "Section C" and "Section D" in the new version of 
this instance, the side -by-side list of segments 212 can the document. Then, the document server 104 will lists the 
specify "section f in both the old sectioa list and the new segments in the old version of the document and the seg- 
section list. ments in the new version of the document as containing 

In one embodiment, if a segment in one document does changes between the old and new versions of the document, 
not have a comparable segment in the other document, the 15 In this instance, the user may benefit from requesting a 
segment that is present will be displayed in the appropriate comparison of "Section A" from the old section list and 
segment list. For example, if the specified document from "Section B" from the new section list to determine the 
the old document list contains a "section k" which is not presence of substantive changes beyond the change in 
present in the specified document from the new document segment headings. Moreover, the user may advantageously 
list, then "section k" appears as an entry in the old segment 20 select one segment, either from the old segment list or the 
list and not the new segment list. Likewise, if the specified new segment list, and submit the selected segment for 
document from the new document list contains a "section comparison. In this case, the document server computer 104 
m" which is not present in the specified document from the need not compare the selected segment and the selected 
old document list, then "section m" appears as an entry in the segment's contents are identified as being changed, 
new segment list and not the old segment list. Analogously, 25 Proceeding to state 420, the user is presented a web page 
if the user submits one document for comparison, the containing a side-by-side display of segment contents 214 
segments contained in the specified document appear in the (see FIG. 2). The side-by-side display of segment contents 
appropriate segment list as containing differences. 214 displays the contents of the one or more segments 

In one embodiment, the segment list may advantageously previously submitted for comparison. The contents of the 
identify segments of the document containing differences or 30 segment from the old segment list is displayed in one display 
changes from like or comparable documents. The segments while the contents of the segment from the new segment list 
may advantageously be any identifiable portion of the docu- is displayed in the other, second display. In the display, the 
ment such as, by way of example, sections (including components of the segment containing differences or 
subsections, sub-subsections, etc.), chapters, pages, arbitrary changes between the submitted segments are appropriately 
number of paragraphs, sentences, words, nibbles, bytes or 35 identified. In the case of a textual document, the component 
any other unit of storage, topics, acts, scenes, units of time, may advantageously be a sentence or a paragraph. In the 
and the like. case of a graphical document, the component may be a 

Proceeding to state 414, the user selects a segment from number of pixels or lines of display. In the case of a video 
the old segment list. The user may advantageously select a display, the component may be a number of frames. Those 
segment from the new segment list in state 416. The user 40 of ordinary skill in the art will realize that the selection of the 
then submits the selected segments to the document server component is arbitrary and may differ depending on the 
computer 104 for comparison in state 418. In one document type. The components may be appropriately dis- 
embodiment, the user may select any segment from the old tinguished in the display by displaying in a different color, 
segment list and any segment from the new segment list. font, type size, intensity, contrast, and the like. Within the 
Advantageously, the user selects segments with the same 45 components, the actual subcomponent, such as, by way of 
segment identifier or segment label. The document server example, the word, letter, byte, bit, or pixel, which is 
computer 104 advantageously compares the submitted seg- different between the two displayed segments may be further 
ments and appropriately identifies the changes or differ- distinguished. Having viewed the differences between the 
ences. one or more submitted segments, the document comparison 

Typically, the user selects like segments, for example, an 50 process proceeds to end state 422. 
old version of a particular segment from the old segment list The web server 204 provides access to the communication 
and the new version of the same segment from the new medium 106 and delivers the plurality of web pages stored 
segment list, and submits the specified segments for com- on the document server computer 104 to the one or more 
parison. This may advantageously be implemented auto- user computers 102. The plurality of web pages facilitate the 
matically such that, when the user selects a segment from 55 comparison of documents stored in the document database 
either the old segment list or the new segment list, the 208. The web server 204 may be developed using standard 
corresponding segment in the other list is automatically web server software applications such as, by way of 
selected. Alternatively, the user can select one segment from example, public domain software from NCSA and Apache, 
the old segment list and a different segment from the new and commercial packages such as Netscape's Internet Server 
segment list for comparison by the document server com- 60 software, Microsoft's Internet Server software, and the like, 
puter 104. This may be appropriate in an instance where the These web pages are accessible by users executing a stan- 
userknewof an insertion of a new document segment in the dard browser on the user computer 102. In another 
new version of the document. Insertion of this type may also embodiment, the functionality providing access to, and 
be identified by a program, such as, by way of example, a delivery of, the plurality of web pages may be developed 
compare program, executing on the document server 65 using a proprietary or non-standard software application. In 
computer, and indicated appropriately. The segments this case, the user can execute a comparable software 
appearing after the newly inserted segment may have been program, capable of interfacing to the proprietary or non- 
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standard software executing on the document server com- embodiment, each file may advantageously correspond to a 

puter 104, on the user computer 102 to access the web pages segment contained in a document. 

on the document server computer 104. Certain documents contain segment delimiters such as 

The interface module 206 performs the requested docu- chapters, sections, and subsections. In one embodiment, the 

ment comparison and facilitates communication between the 5 smallest unit, such as a section or subsection, containing text 

web server 204 and the document database 208. For ma y te considered a segment and stored in a separate file, 

example, the document server computer 104, in processing In anothcr embodiment, a section may be considered a 

a request to compare documents, receives the request W and each section of the document, including all 

through the web server 204. The web server 204 extracts the subsections contained within the section may be stored in 

oecesiary information, and this information is advanta- 10 "F™ fi^ Other documents may not contain segment 

, 7 " , ' t . j | *yr\£. a _ r delimiters, but may be continuous in form. In this instance, 

geously processed by the interface module 206. As part of ^ document ^ computer 104 may create artificial 

the processing, the interface module 206 accesses the docu- m lhe process of storing thc documenl ^ tne 

ments and other data stored in the document database 208. document database 208. For example, for a text document, 

Furthermore, the interface module 206 performs necessary twenty-six (or any other arbitrary number) lines of text may 

operations, such as, by way of example, comparing the is be considered a segment. For a video document, an arbitrary 

document contents and presenting some or all of the docu- number of frames may be considered a segment. In still 

ment contents to the user through the web server 204, on the another embodiment, the document server computer 104 

document contents and other data retrieved from the docu- may advantageously contain program logic capable of pars- 

ment database 208 in the manner disclosed herein. ing and the document contents and subsequently generating 

In one embodiment, the interface module 206 uses the 20 segments, and titles or segment headings for the created 

Common Gateway Interface (CGI) protocol to process the segments, based on the program's interpretation of the 

information gathered from, and presented to, the user document contents. As an example, the program logic may 

through the web server 204. The other operations, such as, advantageously parse the contents of a video document and 

by way of example, comparing the document contents and create segments and segment headings based upon the 

identifying the different or changed contents, performed by 25 program's perceived interpretation of the video document, 

the interface module 206 as disclosed herein may advanta- Those of ordinary skill in the art will realize that the actual 

geously be implemented using scripting languages, such as number files containing the document contents can vary 

Unix shell (sh, ksh, or bash) and JavaScript, and other based on the determination of what a segment is for a 

standard programming languages such as C and C++. particular document during the storing of the document in 

Furthermore, the interface functionality enabling the inter- 30 the document database 208. 

face module 206 to access the document database 208 may In another embodiment, more layers may be present in the 

be implemented using the application language suited for the directory tree 90. For example, the "Standards and Recom- 

particular document database 208. Those of ordinary skill in mended Practices" document may be composed of five 

the art will realize that the selection of the particular volumes, "VOL1" through "VOL5." In this instance, the 

software language is arbitrary, and that, any software lan- 35 node "Standards and Recommended Practices" may advan- 

guage capable of implementing the functions and features tageously contain five directories or nodes representing the 

described herein may be used without detracting from the five volumes. Each volume or node can then contain the 

scope of the invention. directories "OLD" and "NEW." 

The document database 208 is a repository for the docu- In still another embodiment, more directories may be 

ments stored on the document server computer 104. In one 40 present depending on the number of versions of the docu- 

embodiment, the document database 208 utilizes a hierar- ment that are stored on the document database 208. As an 

chical file system, such as the Unix file system, in imple- example, the document server computer 104 may advanta- 

menting the document repository. The structure of the hier- geously store all the PTO documents as one of the category 

archical file system facilitates the storage of the electronic of documents in the document database 208. The "PTO" 

contents in one or more hierarchies or levels. As is generally 45 category of documents may consist of hundreds of docu- 

illustrated by the document tree 90 in FIG. 9, at the top level ments. These documents may include, for example, the 

is the root directory. Below the root directory is a directory following documents: "Manual of Patent Examining Proce- 

containing the one or more categories of documents. Each dures" ("MPEP") and "Trademark Manual of Examining 

category of document is an element or node in this level. Procedures" ("TMEP"). Furthermore, each documenl may 
Below each category of document may be one or more 50 be very large, consisting of hundreds or thousands of pages, 

directories representing the documents contained within the Additionally, the three most recent editions of the MPEP 

particular category of documents. Each document is an may be stored in the documenl database 208. The document 

element or node in this level. For example, as is illustrated server computer 104 may advantageously permit users, such 

in FIG. 9, the category of document "FAA" may be one node as patent attorneys, to compare the three stored editions of 

in the categories of documents level and may contain 55 the MPEP. The edition identifier, such as, by way of 

hundreds of documents titled "DOC1" through "DOCN" example, the edition number, the month and year 

and including the "Standards and Recommended Practices" designation, or a combination of both, may be used to 

document and the "Digital Voice Recorder" document. As is distinguish the document versions. For example, the three 

also illustrated in FIG. 9, the category of document "DOD" editions of the documents may be appropriately identified by 
may be another node in the category of documents level and 60 the text strings "/PTO/MPEP/FIFTHEDITION," "/PTO/ 

may include the documents "DOC1" to "DOCX." M PEP/SIXTH EDI TI ON ," and "/PTO/MPEP/ 

Below each document directory is an "OLD" directory SEVENTHEDinON." Segments of each document may 
and a "NEW" directory. The "OLD" directory contains the advantageously be the actual sections and subsections con- 
old version of the document and the "NEW" directory laining substantial data and appearing in the respective 
contains the new version of the particular document. Within 65 MPEP edition. 

the "OLD" and "NEW" directory are one or more files A patent attorney interested in quickly identifying the 

containing the contents of the respective document. In one differences between the three editions of the MPEP may then 
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use a user computer 102 and remotely access the document 
server computer 104. The patent attorney can then request to 
compare the "PTO" category of document. Upon receiving 
a listing of documents contained in the requested "PTO" 
category of document, the patent attorney may advanta- 5 
geously select the MPEP document for comparison by the 
document server computer 104. The document server com- 
puter 104 may advantageously list the segments contained in 
each of the three editions of the MPEP which contain 
differences in a side-by-side-by-side display. In one 10 
embodiment, if one segment in one edition of the MPEP is 
different from one other comparable segment in either of the 
other two editions of the MPEP, that segment identifier is 
listed in the side-by-side-by-side display. In another 
embodiment, a segment in one edition of the MPEP must be 15 
different from the other two comparable segments in the 
other two editions of the MPEP in order for the segment 
identifier to be listed in the side-by-side -by-side display. The 
patent attorney may then advantageously select one segment 
identifier from each of the three lists for comparison by the 20 
document server computer 104. The document server com- 
puter 104 may then compare the contents of the selected 
segments and display the segment contents in a side-by- 
side-by-side display appropriately distinguishing the 
changed segment contents. 

In another alternative embodiment, the document server 
computer 104 may list the three editions of the MPEP in a 
web page and request the user to select two of the three 
editions of the MPEP for comparison. The patent attorney 
may then select two editions for comparison by the docu- 
ment server computer 104. Subsequently, the two specified 
editions of the MPEP can be compared by the document 
server computer 104. In still another alternative 
embodiment, the MPEP document may advantageously 
include both editions and revisions of the MPEP. In this 
instance, differing versions of the document may be identi- 
fied by, for example, a combination of the edition number, 
the month and year designation, and the revision number. 
Those of ordinary skill in the art will realize that the version 
indicators, such as the "OLD" and "NEW" directories, may 
be located in another hierarchy or level in the document tree 
90 without detracting from the scope of the invention. 

A directory or file present in one directory does not 
necessarily have to be present in another directory. For 
example, a "SECTION B" may be present in the "OLD" 
directory for the "Standards and Recommended Practices" 
document but not present in the "NEW" directory for the 
particular document. In this instance, the "SECTION B" 
may have been deleted in going from the old version of the 
document to the new version of the document. 

In another embodiment, document database 208 may be 
implemented with Structured Query Language (SQL) code. 
SQL is a relational database language standardized by the 
International Standards Organization (ISO). The document 
database 208 can be implemented utilizing any number of 55 
commercially available database products such as, by way 
of example, Microsoft® Access and the like. In still another 
embodiment, the document database 208 may conform to 
any database standard, or may even conform to a non- 
standard, private specification. The hierarchical structure of 60 
the document database 208 may be implemented using the 
selected database. In still another embodiment, the docu- 
ments may be stored in the document database 208 in units 
of storage recognized by the particular database, and the 
contents of the units may be identified, retrieved, compared, 65 
modified, and listed in order to facilitate the hierarchical 
comparison of the electronic contents as disclosed herein. 
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One embodiment of the interaction between the compo- 
nents of the document server computer 104, in particular the 
web server 204, the interface module 206, and the document 
database 208, in processing a document comparison request 
is generally illustrated in FIG. 5. Beginning in a start state 
500, the document server computer 104 receives a user 
request to perform a document comparison. In particular the 
document server computer 104 receives the requested cat- 
egory of document to compare and the user's identifying 
information such as, by way of example, a password. 

Proceeding to state 504, the user's identifying information 
is validated to ensure that the user is authorized to access the 
information contained in the specified category of docu- 
ment. In one embodiment, a data record may advantageously 
be used to maintain a list of users authorized to access the 
one or more categories of documents stored in the document 
server computer 104. The document server computer 104 
can locate the data record for the particular category of 
document specified by the user and verify that the received 
user identifying information is found in the list of authorized 
users. By way of example, the user may specify "DOD" as 
the category of document and submit a password. Upon 
receipt of this request, the document server computer 104 
can locate the "DOD" data record and determine if a 
password is required for access. If no password is required, 
then access is granted. If a password is required, then the 
"DOD" data record is searched to locate the submitted 
password. If the submitted password is not found in the 
"DOD" data record, an error message is displayed on the 
user computer 102 in state 506 and the document server 
computer 104 proceeds to end state 522. 

Continuing the above DOD example, if the "DOD" cat- 
egory of document is not password protected, or the user 
submitted password is found in the "DOD" data record, the 
document server computer 104 lists the documents con- 
tained in the "DOD" category of document in a side-by-side 
display of list of documents in state 508. The old version of 
the documents contained in the requested category of docu- 
ments may advantageously be listed in one side of the 
side-by-side display while the other side of the side-by-side 
display may advantageously list the new version of the 
documents contained in the specified category of docu- 
ments. In another embodiment, a comparison is performed 
between the old version and new version of the documents 
contained in the specified category of document and the 
documents containing differences are displayed in the side- 
by-side display. With reference to the directory tree 90 (see 
FIG. 9), a check of the time stamp or the size of the directory 
for the directory nodes representing the documents can be 
performed to determine the documents which have changed 
between the old version and new version of the documents. 

For the DOD example, with reference to the previously 
disclosed document tree 90 (see FIG. 9), each document in 
the "DOD" category of documents is listed in the side -by- 
side display. One embodiment of the side-by-side display of 
the list of documents is generally illustrated in FIG. 6. The 
old document versions may advantageously be presented in 
one list and the new document versions may advantageously 
be presented in another list. Furthermore, the document lists 
may be presented in a scrollable display. Continuing the 
DOD example, the side-by-side display can list the docu- 
ments "DOD/DOC1/OLD" through "/DOD/DOCX/OLD" 
in the old list while listing the documents "DOD/DOC1/ 
NEW" through "/DOD/DOCX/NEW" in the new list. From 
this screen, the user may advantageously use a pointing 
device, such as a mouse or the like, and select the documents 
7D0D/D0C1/0LD" and "/DOD/DOC1/NEW" and subse- 
quently submit the documents for further comparison. 
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Proceeding to slate 510, the document server computer components, such as, by way of example, sentences, 
104 receives the user submitted documents for further paragraphs, images, photographs, and video frames, con- 
comparison. The submitted documents are compared in state taining differences between the segments are identified. The 
512. In one embodiment, each document segment in the old subcomponents, such as, by way of example, words, letters, 
version is compared with the comparable or like segment in 5 pixels, and frames, are further identified, 
the new version. In the DOD example, the contents of Proceeding to state 520, the document sever computer 
"/DOD/DOCl/OLD/SECr is compared to the contents of 104 displays l *Je contents of the particular segments in a 
"/DOD/DOCl/NEW/SECI." The identified segments are side-by-side display as generally iUustraied in FIG. 8 Each 
compared until a first difference is detected or until the J* a * «f to indicate the segment being 
segments compare successfully. If a difference is detected, io displayed Hie contend of one requested segment are dis- 

& . . r . ; , , , played on one side and the contents of the other requested 

the particular segments are identified accordingly as con- ^ men{ afe ^ { d oQ ^ ^ ^ If om Qne * 

taming differences or changes, and the particular segments wafi for ^^ison in state 516, then only** 

are no longer compared. In another embodiment, the time segment's contents are displayed, and one of the displays 

stamp, or the file size, of the files representing similar may advantageously be empty. Furthermore, the displays 

segments are compared in determining whether the segment is may be scrollable to provide the user ease of navigation in 

has changed from one version to the other version. This viewing the displayed contents. In another embodiment, the 

comparison process continues until all document segments contents of the particular segments may be displayed in one 

are compared. display or screen. In the display, the differences in the 

In another embodiment, the segment comparison may be content may advantageously be indicated by methods such 

continued until a user specified threshold difference level is 20 as, by way of example, underlining, "redlining," or differing 

satisfied. As an example, the user may specify a threshold of colors. 

10 words which need to be different before a segment is In one embodiment, the previously identified segment 
identified as containing modifications or changes between components and subcomponents which have changed 
the versions. If a particular segment contains differences between the specified segments are distinguished from the 
satisfying the threshold level, the particular segment is 25 other segment contents for clarity and ease of identification, 
identified as containing differences or changes. In still For example, differing colors, shading, or dashed outlines, as 
another embodiment, the user can specify a particular word indicated by hashes in FIG. 8, may be used to further 
or phrase as the threshold difference level. In this instance, distinguish the components containing differences from the 
segments containing an addition/deletion/change to the rest of the segment contents. Lighter dashed lines or 
specified word or phrase between the old version and the 30 x-hatching, for example, would be suitable for image corn- 
new version may advantageously be identified as containing parison to identify areas of change in the side-by-side 
differences or changes. If either the old version or new images. The individual subcomponents which are different 
version of a particular segment is nonexistent, then the may advantageously be further distinguished by underlines, 
existing segment is identified as containing differences or Continuing the DOD example, the contents of "/DOD/ 
changes. 35 DOC1/OLD/4.1.2" are displayed on the left in the side-by- 
The document segments containing differences between side display (see FIG. 8). The contents of "/DOD/DOC1/ 
the old document version and the new document version are NEW/4.1.2" are displayed on the right in the side-by-side 
presented to the user in a side-by-side display in state 514. display. By way of example, "/DOD/DOC1/OLD/4.1.2" and 
One embodiment of the side-by-side display of the list of "/DOD/DOC1 /NEW/4. 1.2" relate to tank operations. The 
segments containing differences between differing versions 40 component identified as containing differences between the 
of the segments within a document is generally illustrated in submitted segments "DOD/DOC1/OLD/4.1.2" and "/DOD/ 
FIG. 7. The old version of the identified segments may DOC1/NEW/4.1.2," is the text which starts with "b)" and 
advantageously be presented in one list and the new version which is indicated by hashes in FIG. 8. The subcomponents 
of the identified segments may advantageously be presented of the segments that are different between the submitted 
in another list. The segment lists may be presented in a 45 segments "/DOD/DOC1/OLD/4.1.2" and "DOD/DOC1/ 
scrollable display. Continuing the DOD example, the com- NEW/4.1.2" are further indicated by underlines in FIG. 8. 
parison of the requested documents "/DOD/DOC1/OLD" The underlines indicate that the subcomponent "THE" has 
and "/DOD/DOC1/NEW" may have identified the segments been changed to "MTHE", and the subcomponent "4.1.4;" 
listed in the side-by-side segment list illustrated in FIG. 7 as has been changed to "4.1.5;". 

containing differences between the old version and the new 50 In another embodiment, the electronic contents stored in 
version. Utilizing this list, the user can quickly identify the the document database 208 may include voice information, 
document segments containing differences or changes In comparing and presenting the differences between differ- 
between the two versions. From this screen, the user can ing versions of the voice information, the document server 
identify and submit one or more segments to the document computer 104 can transform the voice information into 
server computer 104 for further comparison. For example, 55 textual form and present the textual form of the voice 
the user can select and submit the segments "/DOD/DOC1/ information, with the appropriate differences indicated as 
OLD/4.1.2" and "/DOD/DOC1/NEW/4.1.2" to the docu- disclosed herein, to the user in the side-by-side display, 
ment server computer 104, whereupon the contents of the Furthermore, the voice information may advantageously be 
contents of the submitted segments are advantageously stored in the textual form in the document database 208. 
compared and displayed to the user. Note that the labels for 60 Thus, the voice information may advantageously be sepa- 
the document segments could be any label useful to the user. rated and stored in logical segments. In still another 
Thus, for example, the text of the section heading could be embodiment, the voice information may be stored in the 
included. document database 208 as sound signals, and these signals 
Proceeding to state 516, the document server computer can subsequently be separated into logical segments, com- 
104 receives the user submitted segments for further com- 65 pared for differences between versions, and presented to the 
parison. The specified segments are retrieved from the user through sound devices attached to the user computer 
document database 208 and compared in state 518. Segment 102. 
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In still another embodiment, the electronic contents stored 
in the document database 208 may include video informa- 
tion. The video information may be separated into logical 
segments such as, by way of example, different scenes, 
different topics, a predetermined length of time, or a prede- 
termined number of frames. In comparing and presenting the 
differences between differing versions of the video 
information, the document server computer 104 can com- 
pare the appropriate segments of the video and determine the 
segments containing differences. The segments containing 
differences can be presented to the user in a side-by-side 
display as disclosed herein. The user can then select one or 
more video segments for further comparison by the docu- 
ment server computer 104. The document server computer 
104 can identify components within the video segments 
containing differences between the submitted video seg- 
ments. The submitted video segments can then be played in 
the side-by-side display with one video segment playing in 
one display and the other video segment playing on the other 
display. The video segments can be played simultaneously in 
the side-by-side display. Alternatively, the user may advan- 
tageously control the playing of the individual video seg- 
ments. Furthermore, the when a component of the video 
segment containing differences is playing, an appropriate 
indicator, such as a light or banner message, may appear on 
the display alerting the user to the fact that the video 
segment being displayed on one or both of the side-by-side 
displays is a component of the video segment containing 
differences. A similar approach could be taken for segments 
of a recorded sound document such as music. 

FIG. 3 illustrates one embodiment of the flow of infor- 
mation between a user computer 102 and the document 
server computer 104 when the user accesses the web pages 
stored on the document server computer 104 in requesting a 
document comparison. In event A, the user utilizes a 
browser 202 executing on his or her user computer 102 and 
accesses the document server computer 104 through the 
communication medium 106. In particular, through a web 
page stored on the document server computer 104, the user 
submits information including a category of document to 
compare and a user password. In event B, the document 
server computer 104 verifies the user submitted information 
and displays a list of documents contained in the user 
specified category of document. The documents may advan- 
tageously be displayed to the user in a side-by-side display 
through a web page displayed on the user computer 102. 
One side of the side-by-side display may advantageously list 
an old version of the documents (old document list) and the 
other side of the side-by-side display may advantageously 
list a new version of the documents (new document list). 
Through this web page, the user can select one or more 
documents for further comparison by the document server 
computer 104 in event C. For example, the user may select 
one document from the old document list and another 
document from the new document list. The selected docu- 
ments can be alike or substantially similar. Alternatively, the 
selected documents can be distinct and different 

In event D, the document server computer 104 compares 
the user specified documents. In particular, the document 
comparison is performed by identifying segments within the 
documents, and comparing a segment in one document with 
a like or substantially similar segment in the other document. 
Segments containing differences or changes are appropri- 
ately identified and displayed to the user in a side-by-side 
display through a web page displayed on the user computer 
102. The side-by-side display lists the segments identified as 
containing differences between the submitted documents. 



0,620 Bl 

22 

For example, the identified segments from the document 
specified in the old document list may advantageously be 
listed in one of the side-by-side displays. The identified 
segments from the document specified in the new document 

5 list may advantageously be listed in the other of the side- 
by-side display. 

Through this web page, the user advantageously selects 
one or more identified segments for further comparison by 
the document server computer 104 in event E. For example, 

10 the user may select a segment from the old version of the 
document and a substantially same or like segment from the 
new version of the document. In another embodiment, the 
user can select any one segment from one of the side-by-side 
displays and any other segment from the other side-by-side 

15 display. In this instance, the selected segments do not have 
to be substantially similar. In event F, the document server 
computer 104 compares the user submitted segments. The 
contents of the segments are compared and the differences or 
changes between the submitted segments are appropriately 

20 identified. Furthermore, the area or region of the segment 
containing the identified differences or changes are also 
identified. The document server computer 104 then displays 
the contents of the user selected segments in a side-by-side 
display through a web page displayed on the user computer 

25 102. In the side-by-side display, the identified differences or 
changes within the segments, as well as the identified region 
or area of the segment containing the differences or changes, 
are displayed in a contrasting manner for easy identification 
by the user, 

30 Subsequent to viewing the contents of the requested 
segments, the user can re-access the side -by-side list of 
segments containing differences and select one or more 
different segments from the side -by-side display for com- 
parison by the document server computer 104. In one 

35 embodiment, the plurality of web pages facilitating the 
comparison of documents can be implemented with a "pre- 
vious page" button well known to those of ordinary skill in 
the art. The user can use a pointing device, such as a mouse 
or the like, and click on the "previous page" button in the 

40 web page displaying the contents of the selected segments to 
access the side-by-side list of segments containing differ- 
ences. The user may then advantageously select one or more 
segments for comparison by the document server computer 
104. This process may be repeated until the user has selected 

45 and viewed the differences contained in the desired seg- 
ments. Thus, the user may, but is not forced to, re -specify the 
category of document, or the documents of interest, after 
each comparison of the requested segment by the document 
server computer 104. 

50 In another embodiment, the web pages facilitating the 
comparison of documents as disclosed herein may advan- 
tageously include a text entry area. The user may then 
specify one or more documents in the text entry area. 
Alternatively, the user may also specify one or more seg- 

55 ments in the text entry area. The document server computer 
104 may advantageously receive the one or more documents 
or the one or more sections entered by the user in the text 
entry area and subsequently perform the appropriate com- 
parison. Thus, a more knowledgeable user may, but is not 

60 obliged to, make selections from the sequence of side -by- 
side displays in performing a document comparison. The 
more knowledgeable user may circumvent the sequence of 
making selections through the side -by -side displays by 
specifying the desired documents or segments for com pari- 
es son through the text entry area. 

The invention advantageously performs an efficient com- 
parison of the differing versions of documents contained 
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within a category of documents. The documents are com- 
pared in hierarchies or stages. In the first stage, the document 
contained in the requested category of documents is pre- 
sented. Subsequently, if a particular document is selected for 
further comparison, the document segments are compared 
until a first difference is detected between the different 
versions of substantially similar or like segments. Each 
segment is compared until a first difference between ver- 
sions is detected. If a difference is detected, the segment 
comparison ceases, and the segment is identified as contain- 
ing differences or changes. The identified segments are 
subsequently presented to the user. Only when a particular 
segment is targeted for further comparison is the entire 
segment compared and identified differences accordingly 
displayed along with the rest of the segment contents in a is 
manner as disclosed herein. 

The invention affords the user an efficient comparison 
utility. In performing a comparison of a category of 
document, the user is presented differences in hierarchies or 
stages. The user is first presented with documents contained 
in the specified category of document. Upon designating a 
document for further comparison, the user is presented with 
segments within the document which contain differences. 
Upon selecting a segment for further comparison, the user is 
presented with the contents of the segment with the differ- 
ences appropriately distinguished for ease of identification. 
Thus, the user is able to compare a category of documents 
in an efficient and hierarchical manner. 

As an example, a very large agreement with substantial 
exhibits, such as a contract specifying the operational 
requirements for a newly proposed tank, can be stored on the 
document server computer 104. The agreement may contain 
thousands of pages and may additionally be comprised of a 
multiple number of volumes. Furthermore, each volume can 
contain thousands of sections. Authorized users may advan- 
tageously be permitted to make modifications to the stored 
agreement during, for example, the agreement negotiation 
process. In one embodiment, authorized users provide a 
password to the document server computer 104. If the 
password is authenticated, the user is permitted to make 40 
modifications and changes to the agreement stored on the 
document server computer 104. 

The modifications can be made to a copy of the agree- 
ment. Thus, the agreement without the modifications or 
changes can be one version of the agreement and the 
agreement containing the modifications or changes can be 
another version of the agreement. During this negotiation 
process, the agreement can be subject to review by a 
multiple number of interested parties or users. One part of 
the review can include viewing and analyzing the changes 
proposed or made by one or more parties. For example, two 
or more users can simultaneously access the stored agree- 
ment versions and view and comment on one or more 
changes proposed to a particular section of the agreement. 
This invention allows multiple users to remotely access the 55 
stored agreement and quickly and efficiently identify and 
analyze the changes made by one or more parties to the 
agreement without having to view the contents of the entire 
agreement. Additionally, a user can quickly and easily 
identify changes to the agreement made by other authorized 60 
users. 

For example, a user can request the document server 
computer 104 to perform a document comparison of the 
stored agreement versions. The stored agreement may 
advantageously be considered a category of document. The 65 
user can be presented with the one or more volumes con- 
tained in the agreement. The volumes may advantageously 
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be considered the documents contained in the particular 
category of document. The document server computer 104 
advantageously focuses the user to the volumes that actually 
contain differences or changes. In like fashion, the user can 
request a comparison of a specific volume and a specific 
segment contained in the specified volume. At each phase of 
the comparison, the user is presented with a segment listing 
identifying the segments of the volume containing changes. 
The user can then specify one or more segments for the 
document server computer 104 to compare. The contents of 
the requested segments are displayed and the segment 
contents which are different are further distinguished for 
identification. Consequently, the user does not have to 
browse the thousands of pages contained in the agreement to 
observe a particular change. Moreover, the user is able to 
focus the comparison to one of the multiple volumes, and 
one of the thousands of sections making up a volume, in 
observing and analyzing the particular change to the agree- 
ment. Those of ordinary skill in the art will realize that even 
though this invention is suited for comparing very large 
documents or agreements containing one or more volumes 
and thousands of sections, the invention may also be ben- 
eficially used to rapidly and hierarchically compare smaller 
documents and agreements. 

This invention may be embodied in other specific forms 
without departing from the essential characteristics as 
described herein. The embodiments described above are to 
be considered in all respects as illustrative only and not 
restrictive in any manner. The scope of the invention is 
indicated by the following claims rather than the foregoing 
description. 

What is claimed is: 

1. A method of comparing at least two very large and very 
similar documents to quickly locate and identify differences 
in the documents, wherein the documents include a first 
document and a second document each having a large 
number of segments, each segment having a label and 
including a plurality of characters, said method comprising: 
receiving a request to compare the first document and the 

second document; 
performing a character-by-character comparison of at 
least some of the characters of the first document to at 
least some of the characters of the second document; 
determining whether there are differences between the 

first document and the second document; 
listing only the labels, and no other content, of only those 
segments that contain at least one difference in a first 
display in which the listed segment labels from the first 
document appear in a first display portion of said first 
display and the listed segment labels from the second 
document appear in a second display portion of said 
first display; 

receiving a request to display at least one difference 
between (a) at least one segment of the first document 
having a segment label listed in said first display 
portion of said first display and (b) at least one corre- 
sponding segment of the second document having a 
segment label listed in said second display portion of 
said first display; 

displaying the characters of the at least one segment of the 
first document and the characters of the at least one 
corresponding segment of the second document in a 
second display in which at least some of the characters 
of the at least one segment of the first document appear 
in a first display portion of said second display and at 
least some of the characters of the at least one corre- 
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sponding segment of the second document appear in a 
second display portion of said second display; and 
distinguishing the characters which are different in said 
second display. 

2. The method of claim 1, wherein at least one of said first 
display and said second display comprises a side-by-side 
display. 

3. The method of claim 1, wherein one of the first 
document and the second document comprises a previous 
version of the other of the first document and the second 
document. 

4. The method of claim 1, wherein at least one of the first 
document and the second document comprises contents 
selected from the group consisting of text, graphics, audio, 
video. 

5. The method of claim 1, wherein distinguishing the 
characters which are different comprises a technique 
selected from the group consisting of underlining, holding, 
redlining, shading, cross-hatching, use of a differing color, 
use of a contrasting font, use of dashed outlines. 

6. The method of claim 1, wherein one of the at least one 
segment of the first document and the at least one corre- 
sponding segment of the second document comprises a 
previous version of the other of the at least one segment of 
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document in a second display in which at least some of 
the characters of the at least one segment of the first 
document appear in a first display portion of said 
second display and at least some of the characters of the 
at least one corresponding segment of the second 
document appear in a second display portion of said 
second display, wherein the characters which are dif- 
ferent are distinguished in said second display. 

10. The system of claim 9, wherein at least one of said first 
display and said second display comprises a side-by-side 
display. 

11. The system of claim 9, wherein one of the first 
document and the second document comprises a previous 
version of the other of the first document and the second 
document. 

15 12. The system of claim 9, wherein at least one of the first 
document and the second document comprises contents 
selected from the group consisting of text, graphics, audio, 
video. 

13. The system of claim 9, wherein the characters which 
20 are different are distinguished by a technique selected from 

the group consisting of underlining, holding, redlining, 
shading, cross-hatching, use of a differing color, use of a 
contrasting font, use of dashed outlines. 

14. The system of claim 9, wherein one of the at least one 



the first document and the at least one corresponding seg- 2 5 segment of the first document and the at least one corre- 



ment of the second document. 

7. The method of claim 1, wherein at least one of the first 
document and the second document is over one thousand 
pages in length. 

8. The method of claim 1, wherein at least one of the 
segments comprises a unit selected from the group consist- 
ing of chapter, section, subsection, page, paragraph, 
sentence, line, minute, second, nibble, byte, topic, act, scene, 
frame. 
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sponding segment of the second document comprises a 
previous version of the other of the at least one segment of 
the first document and the at least one corresponding seg- 
ment of the second document. 

15. The system of claim 9, wherein at least one of the first 
document and the second document is over one thousand 
pages in length. 

16. The system of claim 9, wherein said computer com- 
prises a server accessible by a user utilizing a browser. 



rt - , . , 17. The system of claim 9, wherein said computer com- 

9.Adocumentcompansonsyste^ 35 prise s a server remotely accessible by a user via a browser 

least two very large and very similar documents to quickly inning on a ^ computer. 

locate and identify differences in the documents, wherein the 18 -j^ syslem of claim 9( herein at least one of the 
documents include a first document and a second document segments comprises a unit selected from the group consist- 
each having a large number of segments, each segment i ng Q f chapter, section, subsection, page, paragraph, 
having a label and including a plurality of characters, said 40 sentence, line, minute, second, nibble, byte, topic, act, scene, 
document comparison system comprising: 

a computer, said computer comprising a module execut- 
able in said computer wherein said module is config- 
ured to receive a request to compare the first document 
and the second document, said module performs a 45 
cbaracter-by-character comparison of at least some of 
the characters of the first document to at least some of 
the characters of the second document and determines 
whether there are differences between the first docu- 
ment and the second document, said module listing 50 
only the labels, and no other content, of only those 
segments that contain at least one difference in a first 
display in which the listed segment labels from the first 
document appear in a first display portion of said first 
display and the listed segment labels from the second 55 
document appear in a second display portion of said 
first display; 

wherein said module is further configured to receive a 
request to display at least one difference between (a) at 
least one segment of the first document having a 60 
segment label listed in said first display portion of said 
first display and (b) at least one corresponding segment 
of the: second document having a segment label listed 
in said second display portion of said first display, said 
module displaying the characters of the at least one 65 
segment of the first document and the characters of the 
at least one corresponding segment of the second 



frame. 

19. The method of claim 1, wherein each of said segment 
labels is unique within one of said first document and said 
second document. 

20. The system of claim 9, wherein each of said segment 
labels is unique within one of said first document and said 
second document. 

21. A method of comparing at least two very large and 
very similar documents to quickly locate and identify dif- 
ferences in the documents, wherein the documents include a 
first document and a second document each having a large 
number of segments, each segment having a label and 
including a plurality of characters, said method comprising: 

receiving a request to compare the first document and the 
second document; 

performing a characler-by-character comparison of at 
least some of the characters of the first document to at 
least some of the characters of the second document; 

determining whether there are differences between the 
first document and the second document; 

listing only the labels, and no other content, of only those 
segments that contain at least one difference in a first 
display in which the listed segment labels from the first 
document appear in a first display portion of said first 
display and the listed segment labels from the second 
document appear in a second display portion of said 
first display; 
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receiving a request to display at least one difference 
between (a) at least one segment of the first document 
having a segment label listed in said first display 
portion of said first display and (b) at least one corre- 
sponding segment of the second document having a 5 
segment label listed in said second display portion of 
said first display; 
displaying the characters of the at least one segment of the 
first document and the characters of the at least one 
corresponding segment of the second document in a 10 
second display in which at least some of the characters 
of the at least one segment of the first document appear 
in a first display portion of said second display and at 
least some of the characters of the at least one corre- 
sponding segment of the second document appear in a 35 
second display portion of said second display; and 
distinguishing the characters which are different in said 

second display; 
wherein each of said segment labels is unique within one 2Q 

of said first document and said second document. 
22. A document comparison system designed to compare 
at least two very large and very similar documents to quickly 
locate and identify differences in the documents, wherein the 
documents include a first document and a second document 25 
each having a large number of segments, each segment 
having a label and including a plurality of characters, said 
document comparison system comprising: 

a computer, said computer comprising a module execut- 
able in said computer wherein said module is config- 30 
ured to receive a request to compare the first document 
and the second document, said module performs a 
character-by-character comparison of at least some of 
the characters of the first document to at least some of 
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the characters of the second document and determines 
whether there are differences between the first docu- 
ment and the second document, said module listing 
only the labels, and no other content, of only those 
segments that contain at least one difference in a first 
display in which the listed segment labels from the first 
document appear in a first display portion of said first 
display and the listed segment labels from the second 
document appear in a second display portion of said 
first display; 

wherein said module is further configured to receive a 
request to display at least one difference between (a) at 
least one segment of the first document having a 
segment label listed in said first display portion of said 
first display and (b) at least one corresponding segment 
of the second document having a segment label listed 
in said second display portion of said first display, said 
module displaying the characters of the at least one 
segment of the first document and the characters of the 
at least one corresponding segment of the second 
document in a second display in which at least some of 
the characters of the at least one segment of the first 
document appear in a first display portion of said 
second display and at least some of the characters of the 
at least one corresponding segment of the second 
document appear in a second display portion of said 
second display, wherein the characters which are dif- 
ferent are distinguished in said second display; 

wherein each of said segment labels is unique within one 
of said first document and said second document. 

***** 
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